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CHAPTER 1 

UNDER-KNOTTED AND OVER-KNOTTED POLYMERS: 
UNRESTRICTED LOOPS 



Nathan T. Moore, Rhonald C. Lua, Alexander Yu. Grosberg 
Department of Physics, University of Minnesota, Minneapolis, MN 55455, USA 

We present computer simulations to examine probability distributions of 
gyration radius for the no-thickness closed polymers of A*' straight seg- 
ments of equal length. We are particularly interested in the conditional 
distributions when the topology of the loop is quenched to be a certain 
knot /C. The dependence of probability distribution on length, N, as 
well as topological state K, are the primary parameters of interest. Our 
results confirm that the mean square average gyration radius for trivial 
knots scales with N in the same way as for self-avoiding walks, where 
the cross-over length to this "under-knotted" regime is the same as the 
characteristic length of random knotting, Nq. Probability distributions 
of gyration radii are somewhat more narrow for topologically restricted 
under-knotted loops compared to phantom loops, meaning knots are en- 
tropically more rigid than phantom polymers. We also found evidence 
that probability distributions approach a universal shape at A > Aq for 
all simple knots. 

1. Introduction 

1.1. The goal of this work 

Consider a random closed polygon of some N segments, all of equal length 
£. What is the probability wtriv(^) that this polygon, considered as a closed 
curve embedded in 3D, is topologically equivalent to a circle, that is, rep- 
resents a trivial knot? What is the probability wjciN) that it represents 
a knot of any other kind, /C? Such questions arose first in the context of 
DNA-Q and other polymeria, and continue to attract significant attention 
to the present day. Although a large body of information has accumulated, 
mostly through computer simulation j I I I, final mathematical understand- 
ing of these questions remains elusive, despite their elementary formulation. 
Meanwhile, a new set of questions came to the forefront in the last sev- 
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eral years. For instance, what is the conditional probabihty density of the 
loop gyration radius given that its topology is fixed to be JC? As a first 
step, what is the average gyration radius of the loop with the Kjven knot 
state K.7 This latter question was first discussed by des Cloizcaux^and then 
re- visited theoreticallj ^'S: and computationall} !^^^^^ ^'^ -pj^g 
citement in the field is partially driven by the idea, first conjectured in the 
wor that topological constraints act effectively like self-avoidance, lead- 
ing to the non-trivial scaling (Rg) ^ A^^", where is the critical exponent 
known in the theory of self-avoiding walks, v « 0.588 « 3/5. 

The distinction between the two groups of questions can be illuminated 
by the comparison with the concepts of annealed and quenched disorder, 
well known in the physics of disordered systems (see, for instance, boolJi^ . 
If the loop is phantom, i.e. if it can freely cross itself, then its topological 
state is annealed. In this case, we can ask what the probability is to observe 
a certain topological state /C. For the loop which is not phantom and cannot 
cross itself, the knot state is frozen, or quenched, and we can discuss physical 
properties of the loop, such as its size or entropy for every given knot state 
/C. 

The main goal of this paper is to look more closely at the probability 
distributions of the gyration radius of the loops which are topologically 
constrained but not constrained otherwise. In section ??, we provide an 
overview of the previous results about the mean square averaged gyration 
radius as well as some related questions of method and simulation technique. 
We shall concentrate on the relatively simple knots, such as Oi, 3i, and 4i, 
formed by rather long polyme rs, w ith N up to 3000. Using the terminology 
introduced in the recent workl^^, we can say we shall be interested mostly 
in the under-knotted regime. This terminology makes simultaneous use of 
both annealed and quenched views of polymer topology. The idea is as fol- 
lows. Consider real polymer loop with some quenched knot /C. It is consid- 
ered over-knotted if upon topological annealing, allowing loop states to be 
sampled without topological constraints, the loop is likely to become topo- 
logically simpler than JC. Otherwise, the loop is considered under-knotted. 
Roughly, loop is under-knotted if it "wants" to have more knots, and it is 
over- knotted if it "wants" to have fewer knots. Whether a quenched loop 
is over- or under-knotted depends on the number of segments, iV, and, in 
general, on some other conditions, such as solvent quality and the like. It 
is because the loop is under-knotted that it may swell, even if there is no 
excluded volume or self- avoidance. Here, however, terminology clarification 
is in order. 
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1.2. Some terminology: non-phantom polymers and 
self- avoiding polymers are two different things 

We should first emphasize the difference between concepts of self-avoiding 
polymers and non-phantom polymers. These two concepts are quite fre- 
quently confused. The idea of self-avoidance always involves certain finite 
non-zero length scale, let say d, such that two pieces of a polymer cannot 
approach each other closer than d. For instance, if one thinks of a polymer 
as a little garden hose, then d is its diameter. Real polymers, of course, al- 
ways have some excluded volume, or some thickness d. On the other hand, 
polymers which we call phantom are imagined to be able to switch from an 
under-pass to over-pass conformations, but, importantly, neither former nor 
later state violate the self-avoidance, or excluded volume, condition. Speak- 
ing about phantom polymers, we should intentionally close our eyes on the 
process - how the polymer passes from under- to over- state. This question is 
irrelevant when we address probabilities or equilibrium statistical mechan- 
ics. In some sense, the idea of a phantom polymer can be illustrated by the 
properties of a DNA double helix in the presence of topo-II enzyme*^. Of 
course, this question of crossing mechanism becomes decisive if one wants 
to look at polymer dynamics without enzymes; for the studies of dynamics, 

the phantom model is meaningless, one should think in terms of reptation 
1 R 

insteao . 

On a more quantitative level, it is known for the polymer with N seg- 
ments of the length i and diameter d that the excluded volume effect does 
not lead to appreciable swelling as long as N < {£/d)^ (see, e.g., boolPl, 
page 91). For dsDNA at a reasonable ionic strength, this implies chain 
length up to about 2500 segments, or 75000 base pairs. In this sense, our 
testing of loops up to = 3000, although dictated by our computational 
possibilities, is also meaningful for the important particular case of DNA. 
On this length scale, it is quite reasonable to neglect the self-avoidance 
condition, and at the same time to work with the polymer which is not 
phantom, because its topological state is quenched (unless enzymes are 
present). 



2. Brief overview of our recent worl 




Our most recent work has investigated the average size of knotted loops. 
The initial focus was on those loops with trivial knot topology, denoted Oi, 
as their size has been addressed theoretically . In collecting data through 
simulation we were able to gather statistically significant information about 
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several other knots of low prime crossing number, specifically, 3i, 4i, 5i and 
52 knots. 

2.1. Simulation methods 

Like other j I I I our initial approach to the problem was to generate 
loops, compute the gyration radius for each of them 

N N 

^^-^EE4' n.=\r^~r,\ , (1) 
i=i j=i 

fi being the position vector of the joint number i, and then analyze the 
generated conformations with several topological invariants. 

Our loop generation routine is discussed in the ??. Importantly, the 
loops are generated without any relation towards their topology. When a 
loop is generated, its knot type is assigned. Therefore, we can use the en- 
semble of all generated loops to address questions regarding the annealed 
topology, such as the population fractions of various knots. At the same 
time, we are able to determine average size, and more generally, the prob- 
ability distribution of size for loops assigned any given knot type, which 
means, we can address the quenched topology questions. 

To determine loop topology we compute several topological 
invariantJ^. For the loops with N < 300, we used Alexander invariant 
A(— 1) and Vassiliev invariants of degree 2 and 3, V2 and v^. The loop was 
identified as a trivial knot when it yielded |A(— 1)| — I, V2 — 0, and = 0. 
For longer loops of N > 300, we were able to use only A(— 1) and V2 in- 
variants, assigning trivial knot status to the loops with |A(— 1)| = 1, and 
V2 — 0. The details of our computational implementation of these invariants 
are described elsewher^21_ Qf course, because of the incomplete nature of 
topological invariants, our knot assignment is only an approximation, and 
surely was sometimes in error. 

2.2. Knot population fractions 

We begin by addressing the annealed topology questions. 

Theoretically, it is believed that the probability of a trivial knot is ex- 
ponential in N: 

WtriviN)^woe^p{-N/No) , (2) 

at least, asymptotically when N ^ 1. Such exponential behavior was ob- 
served in a number of simulation works for a variety of models By now, 
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it is already considered "obvious" by physicists in the field. It is indeed fairly 

00 OA 

obvious th roug h the intuition gained by the studj (see also more 

recent worH^ and references therein) of exactly solvable model of winding 
around a point or a disc in 2D. This model shows that typical Brownian 
trajectory (that is, polymer with £ — > and N — > oo) tends to produce 
a diverging winding angle, that is, an infinite number of turns around the 
point-like obstacle. It does not seem to require a particularly great leap of 
imagination to conclude that at very large N some finite scale knots should 
be formed with a non-zero frequency everywhere along the polymer - and 
this exactly leads to Poisson-like exponential formula l|2Jl. 

With regard to the probabilities of other non-trivial knots, it may be 
argued that they should also be asymptotically exponential 

WK (N) = 4'=) exp i-N/N^) , (3) 

and, moreover, that characteristic length should be the same as that for 
trivial knots: Njc = Nq. This latter idea can be understood by saying that 
for every knot, the loop must eventually become strongly under-knotted if 
iV increases without bound while knot is quenc hed. F ormula (j2Jl was also 
tested, albeit b y a smaller number of simulation J^ l ^^i 

In the worli^, we fit formula (0) to our trivial knot data and found 
critical length, Nq = 241 ± 0.6 and wq = l.OTiO.Ol. This value of A^o is 
consistent with the result on a rod-bead mo deP ^^ in the limit of excluded 
volume radius sent to zero. In other work^El somewhat larger values of 
Nq were reported, closer to 300 or 330. We interpret this discrepancy as 
being due to the fact that w e ex amined the model with all segments of the 
same length, while the work^El dealt with Gaussian distributed segments. 
We consider it an exciting challenge to understand why these two models 
exhibit differing values of characteristic knotting length. 

Figure ?? shows our simulation data for trivial knots fraction, along 
with the data illustrating the relative frequency of other knot types. To the 
accuracy of our simulations, we do not see all non-trivial knot probabili- 
ties decaying with the same characteristic length Nq. However, we tried to 
determine Njc (see equation (jSJ) by fitting the data over sliding window. 
For instance. Table ?? shows the fit parameters obtained on the interval 
500 < TV < 1150, or on the interval on 1150 < N < 3000. It is clearly 
seen that "apparent" characteristic length decreases. Although far from 
proof, this result is consistent with the theoretical argument behind for- 
mula Q and allows one to hypothesize that the asymptotics is just very 
slowly achieved. 
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Table 1. Characteristic Lengths, Nic 
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Fig. 1. The fraction of loops generated with trivially knotted topology followed the well 
known exponential form, equation as a function of loop length A^. Deviation from 
the fit line at large N is due to the incompleteness of topological invariants employed and 
reflects contamination of the supposedly trivial pool with some non-trivial knots. The 
fractional population curves for several different simple knot types are shown and labeled. 
Although their overall decay can be reasonably fit by exponents, the characteristic lengths 
Nfc appear larger than A^o i which probably means that true asymptotics are very slowly 
achieved. 



2.3. Average size of different knots 
2.3.1. Scaling of the trivial knot size 

When averaged over all loops, the mean square gyration radius, (Rg), is 
equal to N£'^/12, which is two times sm aller than the similar quantity for 
linear chains (sec, for instance, boolfl^ see also ??). As regards av- 
eraged over only trivially knotted loops, the theoristJ^^^El predicted, that 
trivial knots develop swelling behavior for N 3> No, in a way similar to 
objects which experience excluded volume forces; 

,2\ . _ / (^Vl2) N if iV < A^o 



(i?.,)triv- |^^^2/i2)A^2. if , (4) 



February 2, 2008 7:4 



WSPC/Trim Size: 9in x Gin for Review Volume 



04-02-29'color 



Under- Knotted and Over-Knotted Polymers: 1. Unrestricted Loops 7 

where scaling power is v ^ 0.589, and where Nq is the same parameter 
introduced in formula 1^)1. 

We want to emphasize here that the first line of the prediction, formula 
, is not connected to any delicate and thus possibly unreliable theoretical 
arguments, but rather comes out of almost pure common sense. Indeed, 
when N <Si Nq, according to formula l(2Jl, there is only marginal probability 
for a phantom loop to have any knot other than trivial. This means, the 
ensemble of trivially knotted loops at these N very nearly coincides with 
the ensemble of all loops, for which (i?^) is certainly equal to Ni'^/ 12. 

Figure (??) demonstrates how our simulation results are consistent with 
formula First of all, we see that indeed the Gaussian scaling (i?^) = 
NP /12 is recovered at N below Nq. Fitting the data over the interval 500 < 
N < 2500, we found the parameters, v « 0.58 ± 0.02 and A « 0.44 ± 0.03. 
It is not only important that v is consistent with expectations, it is also 
important that the value of pre-factor A provides for smooth cross-over 
between regimes at N very close to iVg, as expected (because A is very 
close to Nl'^" « 0.42). 

2.3.2. Corrections to scaling 

Can one pull the analogy between t rivial kno ts and self-avoiding polymers 
further? The temptation in the has been to fit the trivial knot 

data with a more complex perturb atio n formula, motivated by the analogy 
with the excluded volume problerr^S 



\ a' 12 



(5) 



To understand this formula, it is useful to recall its appearance in the 
better known context of the excluded volume problem (here, we re-phrase 
presentation in the book'T^. For the excluded volume (or self-avoiding) 
polymer, one first shows that gyration radius can be written in the form 
(Rg) = N£'^f{x), where f{x) is a universal function of the argument x = 
{d/i)^/N (where d and i are segment thickness and length, respectively). 
For our purposes here, we denote Nq = {i/d)'^ > 1 and then write x = 
{N/N^f^. Whe n cc is small, x <C 1, then f{x) can be presented as an 
(asymptotic) perturbation series in integer powers of x. When x is large, 
X 3> 1, the leading term in f{x) contains the non-trivial scaling power: 
f{x) ~ x^""^, and then the correction terms in this large x asymptotics 
involve negative powers of x, in most cases believec^^to be integer negative 
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powers: f{x) ^ x^"'^ [l + B/x + C/x'^ + . . Using this formula to write 
(Rg) in terms of Nq, we obtain exactly the equation (with A = 1/2). 

This consideration shows that for the excluded volume problem formula 
(0) is only valid at a; 3> 1, or 3> Nq. It is not an interpolation formula 
valid across the cross-over region x ^ 1; it does not connect two asymptotics 
smoothly. For the latter reason, it cannot be considered an interpolation 
for trivial knots. That is why we think it is not correct to fit the simulation 
data to this formula in the range of N other than N ^ Nq (or at least 

N > Nq). 

Unfortunately, our data do not allow for reasonable fit to this formula 
even in the range N > Nq. The reason is seen in the fact that our data 
represent a curve which seems to keep bending upwards as N increases, 
while formula Q implies saturation of the log-log slope to that dictated by 
the power 2iy. A mechanical attempt to fit the formula to the data yields 
physically meaningless values for v which are greater than unity. 

Currently we do not know why data do not fit formula ©. One reason 
may be simply poor statistics and noisy character of data at large A''. It 
might also be an indication of the knot pool contamination at large N 
because of the incompleteness of topological invariants. This is possible, 
but, in our opinion, not very likely given that trivial knot fraction does not 
deviate much from the exponential fit (see Figure ??). In the work , we 
attempted to address this question deeper, introducing the correction for 
the errors in knot assignment. It did not yield much change in terms of (Rg), 
making us a bit more confident that the problem might be somewhere else. 
For instance, it is possible that the formula ((Sj docs not apply to trivial 
knots, indicating some restricted applicability of the very analogy between 
trivial knots and excluded volume polymers. Much work will be necessary 
to clarify this issue. 



2.3.3. Averaged sizes of non-trivial knots 

Our measurement of the swelling of non-trivial knots is shown in figure 
(??). It is overall consistent with findings by earlier wor kW We find 
that the simple knots cross over from an over-knotted state, in which they 
are much smaller than the average sized loop to an under-knotted state in 
which they seem to approach the scaling of trivial knots in an asymptotic 
fashion. The inset in this image shows this asymptotic approach in the form 
of a small parameter, f3 — 1 — {Rl)jc/ {Rl)o decaying with increasing N. 
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Fig. 2. Gyration radius averages over trivially knotted loops. The trivial knot average 
exhibits power law behavior at large similar to that experienced by polymers which 
have excluded volume. The trivial knot data is systematically larger than the average 
over all loops, shown as the solid line in the figure. This topology driven swelling is s een 
to develop beyond the critical length about A^o = 241. Independently collected datJ^^l 
is shown by stars (*) and agrees with our results. 
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Fig. 3. Log-log plot of the mean square gyration radius, {Rg):ct of knot type K, nor- 
malized by the topology blind average over all loops for several particular knot types. 
The inset, which shows the ratio of a particular knot gyration radius to the trivial knot 
gyration radius, 1 — {Rg)/^/ {Rg)ot demonstrates that all knots remain smaller than, but 
approach the size of, trivial knots. 
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3. Probability distributions of the loop sizes 

Our data allow us to make one more step and to look not only at the aver- 
aged value of i?g for trivial and some non-trivial knots, but also at the entire 
probability distributions. We were able to generate and analyze histograms 
of quality (i.e. looking smooth when plotted, a minimum of 10^ loops for 
each curve) for loops of size N < 1200. Predictably, the probability dis- 
tributions are different for different topological classes, such as all loops 
versus loops of a certain knot type /C. Also predictably, the probability dis- 
tributions of spread out as N increases. The latter observation suggests 
the idea of looking at the probability distributions of the re-scaled variable 
p — Rg/{R'^), where the normalization factor (i?^) is taken separately for 
each TV and for each topological entity. 

Our main findings are summarized in figures (??), (??), and (??), where 
we present probability distributions P{p) for the trivial knots Oi (o), trefoils 
3i (A), and 4i knots (□). In the same figures we plot also for comparison 
the analytically computed probability distributions for linear chains and 
for all loops. For linear chains, the necessary distribution Pchain(p) was 
found by Fixman a long time ag as described in ??, we were able to 
derive a similar expression for the probability distribution over all loops, 
irrespective of topology. To avoid overloading the figures, we do not show 
the corresponding data points obtained for linear chains and for all loops, 
but they all sit essentially on top of the theoretical curves (confirming once 
again the ergodicity of our loop generation routine) . 

Comparing the shapes of probability distributions for all loops and those 
with identified quenched topology, we notice that the latter distributions are 
somewhat more narrow. We emphasize, that although the effect looks small 
for the eye, it is well above the error bars of our measurements. This means 
simple knots are less likely to swell much above their average size than other 
knots, and they are also less likely to shrink far below their average, again 
compared to other knots. Figures ?? and ?? show this stiffness in both 
large and small limits of p. In the region p < 1.25 the general notion that 
entropic stiffness goes with topological complexity seems to hold true, i.e. 
more complex knots are more difficult to stretch or compress than arbitrary 
loops of the same number of segments. That the opposite of this seems to be 
true in the large p region is a subtlety not yet fully understood. In any case, 
topology blind loops are by definition always more flexible than topology 
specific loops. 

The small p limit is of particular interest given its relation to all prob- 
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lems involving collapsed polymers, such as proteins. A closer view of the 
small Rg region of the probability distribution is presented in the Figure ??. 
There, the probability distributions are plotted in the semi-^o^ scale against 
p and, in the inset, against 1/p. This can be also understood as the plot 
of "confinement" entropy, which corresponds to the squeezing the polymer 
to within certain (small) radius. The reason why we plot the data against 
1/p is because both Pchain(/o) and Pioops(p) at small p have asymptotics 
~ exp (— const/p) (see formulae (|B.12ll and (jB.lSp ). which corresponds to 
confinement entropy ~ and which can be established by a simple scal- 
ing argument, as described, e.g., in the boolPl (page 42). This 1/p behavior 
is seen clearly in Figure (??). Furthermore, we see indeed that compressing 
any specific knot, trivial or otherwise, is significantly more difficult than 
compressing a phantom loop. Analytical expression of entropy for knots 
is not known, thus far only the i?^"'' ^ p^^l"^ scaling at small p has been 
conjecturedl22l. Although our data is qualitatively consistent with this pre- 
diction in terms of the direction of the trend, more data is needed for 
quantitative conclusion. 
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Fig. 4. The probability density plot for chains^Sl (line), all loops (another line), and 
loops with certain knots (Oi - o, 3i - A, 4i -□) in the range of large p > 1. Distributions 
are presented in terms of the scaling variable p = R^/ {Rg)- The asymptotics calculated 



in ??, equations IB. 121 and IB. 1.31 . are shown in the figure as dashed lines. 



More detailed comparison of probability distributions for different knots 
and different N are presented in the Figure ??. This figure shows a num- 
ber of different probability curves under different conditions. The left col- 
umn of this figure compares the topology of different objects while holding 
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Fig. 5. The probability density plot for chains^SI (line), all loops (another line), and 
loops with certain knots (Oi - o, 3i - A, 4i -□) in the range of small p < 1. Distributions 
are presented in terms of the scaling variable p = Rg/(Rg). The asymptotics calculated in 
??, equations IB. 121 and IB.13i . are shown in the figure as dashed lines. Inset: Semi-log 
probability density plot (or linear entropy plot) at small p against 1/p. 

the length of the objects constant. The right column of the same figure 
shows comparisons of different lengths of the same topology. Significant 
in Figure (??) is the suggestion that probability distributions for differ- 
ent knots become very similar if not identical with increasing N. Indeed 
in the left column in Figure ?? it is difficult to see the difference between 
the distributions for the three distinct topologies for N = 1200 or even for 
N = 660. One way t o understand this effect is to consider the notion of knot 
localizatiorP^^I^22l xhe idea is that every strongly under-knotted loop at 
large iV places its knot in some small fraction of its length, thus looking like 
a trivial knot, with a small bump where the appropriate crossings reside. 
The collapse of -Pac(p) for different, simple knot types, JC, to one curve at 
large N is consistent with this concept of localization. At the same time. 
Figure ?? suggests that probability distribution Pjc{p) for each knot keeps 
evolving with N changing over the cross-over region at N ^ No. 

4. Concluding remarks 

To summarize, in this paper we presented computational results on knots in 
zero thickness loops of N rigid segments of equal length £. To the accuracy of 
our measurements, our data are consistent with the idea that mean square 
gyration radius averaged over the loops which are topologically equivalent 
to trivial knots is larger than the similar quantity averaged over all loops 
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p(p) compare topology P(p) compare lengths 




0.5 1 1.5 p 0.5 1 1.5 




0.5 1 1.5 P 0.5 1 1.5 P 
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Fig. 6. The probability distributions, P(p) for several different lengths, N. Left Col- 
umn: Collapse of several different topologies to one curve at large A^, (compare = 660 
or 1200 to Af = 90), implies that one master curve for under-knotted loops exists, and 
that it is visible for Oi, 3i, and 4i knots at Af > 660. Right Column: Curves for these 
simple topologies, as they differ in length, are certainly more similar to each other than 
they are to the average of all loops. Movement of the curves as N changes is not yet 
understood. 

irrespective of topology. The extent of this additional swelling appears sim- 
ilar to the swelling of self-avoiding walks compared to Gaussian random 
walks. Swelling is characteristic not only of trivial knots, but in general for 
under- knotted loops, in the sense that a topologically quenched loop swells 
if its knot state would have simplified upon annealing of its topology. We 
have examined not only averaged gyration radius, but also its probability 
distribution. We found that topologically under-knotted loops are relatively 
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unlikely to deviate far from their average sizes, either to smaller or to larger 
sizes. We have also found indication that the probability distribution of the 
gyration radius of simple knots becomes universal for all under-knotted 
loops when their length exceeds certain threshold. Importantly, our data 
confirm the existence of a cross-over at TV of the order of A^O; the char- 
acteristic length of random knotting: it is only at N > Nq that there is 
analogy between under-knotted loops and self-avoiding walks, at TV < A^o 
topological constraints have only a marginal effect on the trivial knots. 

How far does the analogy go between self-avoiding polymers and topo- 
logically constrained ones? We were unable to confirm this analogy beyond 
simple scaling; it is unclear whether the (i?^) dependence on N approaches 
its scaling form N'^'^ in the same manner as it happens for self-avoiding 
walks. It is worth emp has izing that there is a field theoretic formulation for 
the self-avoiding walkJ23 but there is nothing of this sort for knots. In our 
opinion, it remains an exciting challenge to find a solid understanding of 
the connection between fluctuation properties of the loop and its topology. 
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Appendix A. Loop generation 

We generated loops of the length N divisible by 3 using the following 
method. To produce one loop, we generated N/3 randomly oriented equi- 
lateral triangles of perimeter 3£. We consider each triangle a triplet of head- 
to-tail connected vectors. Collecting all N vectors from triangles, we 
re-shuffled them, and connected them all together, again in the head-to-tail 
manner, thus obtaining the desired closed loop. 

A similar simpler method applicable for even N and re-shuffling vectors 
obtained from zero sum pairs often yields the loops with overlapping nodes. 
This happens when the re-shuffling results in the succession of some 2m < 
N vectors belonging to exactly m pairs and thus forming the zero sum (i.e., 
closed) sub-loop. The probability of such an event is of order unity, because 
the probability for the two vectors from the same pair to be next to each 
other after the re-shuffli ng s cales as and there are ^ N such pairs; 

more accurate calculation^ shows that this probability approaches 1 — 1/e 
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Fig. 7. The fraction of generated loops which overlapped within the resolution of com- 
putational accuracy. If a set of N/s polygons, where polygons have s sides, is used to 
generate a walk of length N, the fraction of generated loops, ~ N^~'' will overlap exactly. 
This behavior is seen in the image. 

as — > oo. 

For the triangles, the problem is not in any way as severe, because the 
probability for the three vectors of the triplet to be next to each other scales 
as while the number of triangles is still ~ iV, so the overlapping loops 

are rare as 1/iV (and the probability to have two, or, in general, m triplets 
to occupy completely the 3m stretch of the re-shuffled sequence does not 
change the estimate). 

Our test measurements of the fraction of loops overlapping generated 
with pairs, triplets, and pentagons of vectors (squares are 2 pairs), shown 
in figure (??), agree with this understanding. We see in this figure that 
the fraction overlapping at a certain N, when generated in polygons of s 
edges scales like A^^"*. We chose to generate with triplets to avoid the con- 
stant overlap implied by pairs, as well as avoiding the correlation implicit 
with larger sets of objects. Although generated with our method, these 
loops are not members of the set analyzed as they are not single stranded 
loops devoid of self-intersections, but rather a different physical class of 
objects with "petals." The simple example of = 9, see figure (??), illus- 
trates this. Suppose that the three triangles generated have segment vec- 
tors, (oi, 02, 03), (61, 62, ^3) and (cl, C2, C3). By definition, each set of vectors 
within a triangle sums to zero, for example, cl -|- C2 + C3 = 0. A walk is then 
created by a random permutation of all of the segment vectors, for exam- 
ple, (03, C2, oi, &i, 02, &3, Cl, &2, C3). The problem of overlapping, described 
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above, occurs whenever the elements of one or more complete triangles oc- 
cur within a continuous section of the permutation vector. This subsection 
forms a complete loop, as does the rest of the chain and instead of a single 
loop, one has a diagram which looks something like a flower with multiple 
petals coming off of a center axis or set of axes. 



Fig. 8. Our generation routine can produce errant objects which are not loops. 

In practice, there was also a totally different problem. At large N, our 
knot identification routine was sometimes failing because of the perceived 
triple crossing on the projection. A simple rotation by random Euler angles 
resolved this projection problem in all cases. 

Appendix B. Probability distribution of all loops 

In this Appendix we address the problem which, of its own, does not belong 
to the subject of knots. Namely, we consider a phantom loop, which can 
freely pass through itself, and determine the probability distribution of its 
gyration radius. In other, equivalent, words we consider the distribution of 
sizes over the ensemble of all possible loops of the given number of segments, 
N, irrespective of their topology. Our approach here closely follows that of 
the wor iPI by Fixman, where he determined probability distribution for 
the gyration radius of the linear chains. To make our work self-contained, 



(1 ) Generate triangles, considering edges 
to be a vector set with zero sum. 



(2) Randomly permutate the edges, 
ordering in head-tail style. Result is a 
loop or a set of petals. 
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we reproduce below the main steps of Fixman derivation along with our 
results for phantom loops. 

To begin with, we simplify the problem by transforming it from the 
gyration radius of a chain or a loop with N rigid segments of fixed length 
£ to the similar problem with a smaller number of Gaussian distributed 
segments. To achieve this, we group N segments in n blobs of N/n segments 
each. We denote as bffk the end-to-end vector of each blob labeled k, where 
6^ = {N/n)£'^. Note that mean squared gyration radius, which is well known 
for both chains and loopJEI can be expressed in terms of either N and £ or 
n and b: {Rl)chi,in = N£^/6 = nh'^/Q for chains and (ii^ioop = N£'^/12 = 
for loops. 

If N/n ^ 1, then probability distribution for the unitless vec- 
tor fj is Gaussian, with zero mean and unit variance: g{if} = 
(3/277)"^^^ exp (—377^/2). If, at the same time, n ^ 1, then computing the 
gyration radius 1^ we can replace each blob with the concentrated mass 
N/n sitting, say, at the beginning segment of this blob. Then, formula 

can be transformed to have just n (instead of N) points, where now 
rij — b J2k=i ■ Accordingly, the gyration radius can be expressed as a 
quadratic form of the vectors eta. It is convenient to write it in the form 



where coefficient A is different for chains and loops and can be determined 
at the end to ensure the correct average {{p) = 1), and where kernel G{k, m) 
is as follows: 



We now note that the probability of the chain conformation specified 
by blob end-to-end vectors bffi , 6772 , • • ■ , bffn is given by 




(B.l) 



G(k, to) = -^H(m -k) + -^H(k - m) ; H{x) 

n-^ 



1 for x > 



1/2 fora; = 
for X < 



(B.2) 



^chain({??}) = n diVk) ■ 



(B.3) 



fe=l 



Similar probability for the loop reads 




(B.4) 
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Compared to the distribution for the chains, we have here one more factor g, 
describing the connection between chain head and tail, making the loop; we 
have (5-function ensuring loop closing; and we have also the normalization 
factor. 

Now, in order to compute probability distribution of p, we introduce 
the characteristic function 



K{s) = (e^'"') = / e'P'Z{{Tj})d{iJ} 



(B.5) 



where Z is either .^chain or Z\oo^- Looking at the expressions for Z, (|B.3|I 
or (jB.4p . and for p, (jB.ljl . we see that the three Cartesian components 
of vectors decouple. Taking advantage of this decoupling, we can write 
K{s) = where 



/chain 



27r 



(n-l)/2 



exp 



IS A G{k,m)'rikf 



for chains, and 

/loop(s) 



fc.m— 1 



V2 



fc=i 

dr]idri2 . - .drin-i 



(B.I 



1/2 



n 2 n — 1 

. k=l k=l 



n-1 



isA G{k,m)rik 



rim 



k,m—l 



dr]idri2 ■ ■ ■ drjndp 



(B.7) 



for loops. In the later case, we have used the integral representation of 
the (5-function, thus the extra integration over p. These Gaussian integrals 
are easy to eva lua te, because the matrix G{k, m) is diagonalized, (we have 
omitted detailJ^^, by the unitary matrix C{k,m) — \/2/n sin (7rfcm/n) , 
revealing the eigenvalues of the G matrix, l/fc^7r^ with all integer k from 1 
to n. Upon some algebra, we obtain for chains 

/chain(s) = 11 I 1 - 



k=l 



no 



fe=i 



3fc27r2 



sin z/ Z 



-1/2 



(-^) 

V sm z / 



1/2 



(B., 
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where = 2iAs/3. Similar manipulations for loops involve an extra inte- 
gral over p: 



exp 



2n \ - 



3 ^ 7r2fc2 - z2 

fc=l 



dp : 



(— ) 



1/2 



7r2(2m+ 1)2-^2 

m=0 ^ ' 



tan(z/2)/4z 



-1/2 



z/2 



sm{z/2) 



(B.9) 



where again z2 = 2iyls/3. Finally, we choose coefficient A based on the 
condition (p) = 1, or K'{s)s=o = t. This yields A = 6 for chains and 
A — 12 for loops. Therefore, we finally get 



-fS^chain(s) = (sinz/2;) 



(the result due to FixmarPS), and 



z^ = 4js , (B.IO) 

/^ioop.(s) = (2sin(z/2)/z)-" , z2 = 8is . (B.ll) 

Knowing K{s), finding the probabihty distribution P{p) is the matter of 
inverse Fourier transform. Numerical inversion of Fourier transforms yield 
the curves presented in the Figures ??, ??, and ??. 

Analytically, asymptotic expressions can be found for both small and 
large p. For chains, FixmarP^ found 



9 /6„-3p-9/(4p) 



-Pchain(/3) 

■ ■■ ' —p "e 
Similar expressions for loops read 



for p < 1 



^'loop(p) 



for p > 1 

324^/§p-9/2e-9/(2rt for p « 1 



(B.12) 



(B.13) 



To obtain these results, it is convenient to re-write the inverse Fourier trans- 
form: 



1 

2^ 



1 
2Tn 



dC 



(B.14) 



where in the latter integral (. — z/2 and integration contour V in complex 
C-plane is V-shaped, runs from infinity along the line with argument 37r/4 
to infinity along the line with argument 7r/4. In this form, it is conveniently 
seen that Pioop(p) = at p < 0, as it must be, since p is a positive quantity. 
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Furthermore, deforming the integration contour, we can estabhsh that at 
p <C 1 the integral is dominated by the saddle at C ~ 3i/p, while at p 3> 1 
it is dominated by the residue at the third order pole in ^ = tt, yielding the 
results (|rT31i . 

On a more physical note, it is important to realize that the exponential 
terms in equations (|B.12|I and (|B.13I) at small p are identical if written in 
terms oi Rg, N and £ instead of p. Indeed, the leading term of the corre- 
sponding entropy (which is — InP) is equal to 9N£^ /24Rg for both chains 
and loops. Apart from the coefficient of 9/24, the scaling form of this result 
can be understood based on a simple argument considering confinement of 
either a chain or a loop in a cavity of the size R ^ £^/N (see, for instance, 
booklSl, formula (7.2)). 

On the other hand, at large p chain entropy is 3{TTRg)'^ /2N£'^, while 
loop entropy is four times larger, it is 6{TTRg)'^ /Ni'^ . This can be under- 
stood as follows. For the chain, remembering that entropy of the state with 
end-to-end distance L is 3L^/2iVi?^, Fixman noted EHI that large Rg con- 
formations are dominated by the semi-circular shapes with L = nRg. The 
loop obviously represents two such pieces, so loop entropy is twice entropy 
of the half-chain: 6{nRg)^/Ne = 2 x {3{nRg)^ /2{N/2)e) . 
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